Picture for Yansong Tang

Yansong Tang

CoSTL: Comprehensive Spatial-Temporal Representation Learning for Moment Retrieval and Highlight Detection

Add code
May 31, 2026
Viaarxiv icon

Boosting Zero-Shot 3D Style Transfer with 2D Pre-trained Priors

Add code
May 28, 2026
Viaarxiv icon

SAM3D-Phys: Towards Multi-Object Interactive Simulation in Real World

Add code
May 28, 2026
Viaarxiv icon

SAFE-Pruner: Semantic Attention-Guided Future-Aware Token Pruning for Efficient Vision-Language-Action Manipulation

Add code
May 28, 2026
Viaarxiv icon

DisDop: Distillation with Domain Priors for Open-Vocabulary Aerial Object Detection

Add code
May 23, 2026
Viaarxiv icon

FDDet: Achieving Data-Efficient Food Defect Detection Under Real-World Scenarios

Add code
May 23, 2026
Viaarxiv icon

FoodMonitor: Benchmarking MLLMs for Explainable Compliance Analysis

Add code
May 23, 2026
Viaarxiv icon

Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking

Add code
May 21, 2026
Viaarxiv icon

StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

Add code
May 18, 2026
Viaarxiv icon

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Add code
Apr 27, 2026
Viaarxiv icon